Validation API core #8348

zhiltsov-max · 2024-08-26T16:02:23Z

Motivation and context

Depends on #8272
Depends on #8321

Added server API for creation of a GT job on task creation
Added server support for task creation with GT pool (aka Honeypot)
Added new GT job frame selection method random_per_job, which guarantees each annotation job gets the specified GT overlap, making each annotation job validatable
Added new GT job frame count selection options based on task size % and segment size %
Changed GT job creation parameter "frames" to accept relative frame ids instead of absolute (source data) ones
Allowed frame deletion in GT jobs. Deleted GT frames are considered excluded from validation, so should not appear in quality reports. Frame removal from a simple GT job (in tasks without honeypots) doesn't remove task frames, only the GT job frames.

Server API changes:

GET /api/tasks/{id}/ got a new validation_mode field, reflecting the current validation configuration (immutable)
POST /api/tasks/{id}/data got a new validation_params field, which allow to enable GT / GT_POOL validation for a task on its creation

Tasks with Honeypots

This validation mode affects task creation, so can only be used in task creation. It cannot be disabled or changed after the task is created. When honeypots are configured, each job in the task gets several extra validation frames.
The pool of available frames and the number of validation frames per job are specified by the user at task creation.

Limitations:

This validation mode can only be used with random frame ordering.
Inherently, this assumes that job_frame_mapping and overlap cannot be used in such tasks.
Track annotations are prohibited in tasks with honeypots enabled.

Honeypot frames and GT annotations are accessible via the GT job, as in the case with regular GT jobs. However, unlike regular tasks with GT jobs, task annotation import affects the GT job as well in tasks with honeypots. Task annotation export contains only GT annotations on validation frames (so, only the GT copy of validation frames is included).

How has this been tested?

Checklist

I submit my changes into the develop branch
I have created a changelog fragment
I have updated the documentation accordingly
I have added tests to cover my changes
I have linked related issues (see GitHub docs)
I have increased versions of npm packages if it is necessary
(cvat-canvas,
cvat-core,
cvat-data and
cvat-ui)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.

Summary by CodeRabbit

New Features
- Introduced a server setting to disable media chunks on the local filesystem, enhancing configurability.
- Added tracking for the last assignee update date in quality reports, improving task management.
- Enhanced job chunk identifiers for better clarity and uniqueness.
Bug Fixes
- Resolved memory management issues and refined job assignment logic in video processing.
Documentation
- Updated API schema with new enhancements related to job management and validation processes.
Chores
- Updated package dependencies and added new configuration settings for Redis in the Helm chart.

…sk creation (put chunk creation into the end), need to update chunk generation approach to to per job

…iterator

…matches

cvat/apps/engine/views.py

tests/python/rest_api/test_jobs.py

cvat/apps/dataset_manager/task.py

cvat/apps/dataset_manager/util.py

cvat/apps/engine/backup.py

zhiltsov-max · 2024-09-27T10:48:43Z

@klakhov ,

There is a corner case when Im trying to create a task with 100% honeypots. So it gives me the task with just 1 gt job and 0 regular jobs. Is there a usecase for that? maybe we shoud prohibit such case?

Hm, I think yes, it should produce a validation error during task creation.

I don't see any changes in .rego. Will non-admin users be able to compute quality reports after this pr?

I think it will be ether in a separate PR or in the PR with allocation reports, it seems to be the most relevant place.

cvat/apps/dataset_manager/task.py

zhiltsov-max · 2024-09-27T16:41:01Z

@coderabbitai review

coderabbitai · 2024-09-27T16:41:07Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

klakhov · 2024-09-30T08:43:49Z

cvat/apps/dataset_manager/task.py

+ if data.tracks and db_data.validation.mode == models.ValidationMode.GT_POOL:
+ # Only tags and shapes can be used in tasks with GT pool
+ raise ValidationError("Tracks are not supported when task validation mode is {}".format(
+ models.ValidationMode.GT_POOL
+ ))


So, as far as I understand we cant use tracks in tasks with HP.
But when I try to add a track in UI I get this error:

Seems error message is wrong

Thanks, fixed

@zhiltsov-max What is the nature of such restriction?
I understand that tracks with interpolation do not make sense in honeypot jobs, but is it the only reason?

Frames are in the random order, tracks make no sense in such case. If the frames were ordered, honeypots could be (a) clearly identifiable or (b) could not be reused multiple times. Simple GT job doesn't have such a problem, so it can be used if frame ordering or tracks are required in the task.

It makes sense if each frame of the track is a keyframe.

Just do not understand the reason why we forced this restriction.
If a user does it, why not. It is the user's decision.

…th honeypots

…dation-core

sonarcloud · 2024-09-30T15:14:53Z

Quality Gate passed

Issues
28 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
2.0% Duplication on New Code

See analysis details on SonarCloud

zhiltsov-max added 30 commits July 9, 2024 12:30

Add basic implementation for task creation with honeypots

a6bc9e2

Update api schema

dbb39fd

Implement job creation, add job honeypot update endpoint, refactor ta…

675c160

…sk creation (put chunk creation into the end), need to update chunk generation approach to to per job

Update chunk and manifest creation

9a09510

Fix segment size application

011d5ca

Fix honeypot updating in a job

19e4fb2

t

77dea65

Update frame provider and media cache

b32a9eb

t

cb4ff93

t

d49233c

Support static chunk building, fix av memory leak, add caching media …

146a896

…iterator

Refactor static chunk generation - extract function, revise threading

52d1bac

Refactor and fix task chunk creation from segment chunks, any storage

0c53436

Fix chunk number validation

c166123

Enable formatting for updated components

630c97e

Remove the checksum field

8d710e7

Be consistent about returned task chunk types (allow video chunks)

654a827

Support iterator input in video chunk writing

12e5f2a

Fix type annotation

a79a681

Refactor video reader memory leak fix, add to reader with manifest

d5118a2

Disable threading in video reading in frame provider

1b429cf

Fix keyframe search

d512312

Return frames as generator in dynamic chunk creation

167ee12

Update chunk requests in UI

88a9cb2

Update cache indices in FrameDecoder, enable video play

30bf8fd

Fix frame retrieval for video

ee3c905

Fix frame reading in updated dynamic cache building

dc03220

Fix invalid frame quality

4bb8a74

Fix video reading in media_extractors - exception handling, frame mis…

f7d2c4c

…matches

Allow disabling static chunks, add seamless switching

34d9ca0

zhiltsov-max commented Sep 27, 2024

View reviewed changes

cvat/apps/engine/views.py Show resolved Hide resolved

zhiltsov-max commented Sep 27, 2024

View reviewed changes

tests/python/rest_api/test_jobs.py Outdated Show resolved Hide resolved

azhavoro reviewed Sep 27, 2024

View reviewed changes

cvat/apps/dataset_manager/task.py Outdated Show resolved Hide resolved

cvat/apps/dataset_manager/task.py Outdated Show resolved Hide resolved

cvat/apps/dataset_manager/util.py Outdated Show resolved Hide resolved

cvat/apps/engine/backup.py Outdated Show resolved Hide resolved

zhiltsov-max commented Sep 27, 2024

View reviewed changes

cvat/apps/dataset_manager/task.py Outdated Show resolved Hide resolved

zhiltsov-max added 5 commits September 27, 2024 19:35

Update changelog

c1eeee2

Reduce code duplication on the same check for validation mode

12a3fe0

Fix some other comments

128c78a

Update tests/python/rest_api/test_jobs.py

cafdeec

Remove extra file

89bb815

Merge branch 'develop' into zm/validation-core

5b39661

bsekachev mentioned this pull request Sep 30, 2024

[Dependend] Reset chunks from cache if oudated #8449

Open

7 tasks

klakhov reviewed Sep 30, 2024

View reviewed changes

zhiltsov-max added 14 commits September 30, 2024 12:05

Fix invalid handling of start, stop frames and frame step in tasks wi…

8451fda

…th honeypots

Update tests

3d9fba2

Fix field access

4b2b2dd

Merge remote-tracking branch 'origin/zm/validation-core' into zm/vali…

2fe8fba

…dation-core

Extract allocation table contents building function

8460d5d

Fix job removal

a1bb881

Fix static chunk creation

a4b8a97

Add more test cases

e9e00b0

Remove unused variable

20d3006

Make frame set check more reliable

cc00426

Make segment type check more robust

b5ab1be

Remove extra db call

23bfc2c

Improve error message

d4bc318

Add field description in the api

57f1d71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validation API core #8348

Validation API core #8348

zhiltsov-max commented Aug 26, 2024 •

edited

Loading

zhiltsov-max commented Sep 27, 2024

zhiltsov-max commented Sep 27, 2024

coderabbitai bot commented Sep 27, 2024

klakhov Sep 30, 2024

zhiltsov-max Sep 30, 2024

bsekachev Sep 30, 2024

zhiltsov-max Sep 30, 2024 •

edited

Loading

bsekachev Sep 30, 2024

bsekachev Sep 30, 2024

sonarcloud bot commented Sep 30, 2024

Validation API core #8348

Are you sure you want to change the base?

Validation API core #8348

Conversation

zhiltsov-max commented Aug 26, 2024 • edited Loading

Motivation and context

Tasks with Honeypots

How has this been tested?

Checklist

License

Summary by CodeRabbit

zhiltsov-max commented Sep 27, 2024

zhiltsov-max commented Sep 27, 2024

coderabbitai bot commented Sep 27, 2024

klakhov Sep 30, 2024

Choose a reason for hiding this comment

zhiltsov-max Sep 30, 2024

Choose a reason for hiding this comment

bsekachev Sep 30, 2024

Choose a reason for hiding this comment

zhiltsov-max Sep 30, 2024 • edited Loading

Choose a reason for hiding this comment

bsekachev Sep 30, 2024

Choose a reason for hiding this comment

bsekachev Sep 30, 2024

Choose a reason for hiding this comment

sonarcloud bot commented Sep 30, 2024

Quality Gate passed

zhiltsov-max commented Aug 26, 2024 •

edited

Loading

zhiltsov-max Sep 30, 2024 •

edited

Loading